-
Notifications
You must be signed in to change notification settings - Fork 917
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adds pylibcudf Scalar #14055
Adds pylibcudf Scalar #14055
Conversation
# Convert the value to a cudf object via pyarrow | ||
arr = pyarrow.lib.array([value.as_py()], type=value.type) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
More effort, but I think we can use the C++-level ArrayBuilder
interface to get an arrow array from which we can make a table to pass to libcudf. If we push it all onto C++ then we can probably avoid some back and forth too
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This (untested) might be a useful starting point:
diff --git a/cpp/src/interop/from_arrow.cu b/cpp/src/interop/from_arrow.cu
index 30cfee97fd..1a1d5a9bc2 100644
--- a/cpp/src/interop/from_arrow.cu
+++ b/cpp/src/interop/from_arrow.cu
@@ -472,4 +472,28 @@ std::unique_ptr<table> from_arrow(arrow::Table const& input_table,
return detail::from_arrow(input_table, cudf::get_default_stream(), mr);
}
+std::unique_ptr<scalar> from_arrow(arrow::Scalar const& scalar, rmm::mr::device_memory_resource* mr)
+{
+ auto b = arrow::MakeBuilder(scalar.type);
+ if (b.ok()) {
+ auto builder = std::move(b.ValueOrDie());
+ auto status = builder->AppendScalar(scalar);
+ if (status != arrow::Status::OK()) { CUDF_FAIL("Arrow ArrayBuilder::AppendScalar failed"); }
+ auto array_result = builder->Finish();
+ if (array_result.ok()) {
+ auto array = array_result.ValueOrDie();
+ auto type = detail::arrow_to_cudf_type(*array->type().get());
+ return detail::get_element(
+ detail::get_column(*array.get(), type, true, cudf::get_default_stream(), mr)->view(),
+ 0,
+ cudf::get_default_stream(),
+ mr);
+ } else {
+ CUDF_FAIL("Arrow ArrayBuilder::Finish failed");
+ }
+ } else {
+ CUDF_FAIL("Arrow MakeBuilder failed");
+ }
+}
+
} // namespace cudf
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I was hoping to avoid the C++ API, but at this point I think you're right that the most straightforward solution is to implement at this level. I'll look into it.
7a75d19
to
d6eee4b
Compare
The core of this PR is in #14121, #14124, and #14133. The only remaining bit that's only on this branch is the beginnings of a scatter implementation, which didn't actually get very far. I'll handle that properly in another PR once #14133 is merged, but at this point this PR can be closed since it was primarily to help demonstrate the approach I was taking before I had a chance to really clean up the code. |
Description
Checklist